An on-line acoustic compensation technique for robust speech recognition
نویسنده
چکیده
In this work we report on the use of an on-line acoustic compensation technique for robust speech recognition. With this technique acoustic mismatch between training and actual conditions is reduced through acoustic mapping. At recognition stage, observation vectors delivered by the acoustic front-end are mapped into a reference acoustic space, while input data are exploited to update the statistical parameters of the mapping. Experimental results, obtained for matched and unmatched training and testing environment conditions, show that the investigated technique tangibly improves the performance of a speaker independent speech recognizer based on hidden Markov models. Furthermore, recognition results are close to those obtained with unsupervised incremental model adaptation based on maximum likelihood linear regression.
منابع مشابه
Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods
Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...
متن کاملAn Information-Theoretic Discussion of Convolutional Bottleneck Features for Robust Speech Recognition
Convolutional Neural Networks (CNNs) have been shown their performance in speech recognition systems for extracting features, and also acoustic modeling. In addition, CNNs have been used for robust speech recognition and competitive results have been reported. Convolutive Bottleneck Network (CBN) is a kind of CNNs which has a bottleneck layer among its fully connected layers. The bottleneck fea...
متن کاملFeature vector normalization with combined standard and throat microphones for robust ASR
We propose on-line unsupervised compensation technique for robust speech recognition that combines standard and throat microphone feature vectors. The solution, called MultiEnvironment Model-based LInear Normalization with Throat microphone information, MEMLINT, is an extension of MEMLIN formulation. Hence, standard microphone noisy space and throat microphone space are modelled as GMMs and a s...
متن کاملRobust telephone speech recognition based on channel compensation
Channel compensation technique has been proved to be an e!ective approach for robust speech recognition. In this paper, we compare the performance of our proposed method RMFCC with those of the former channel compensation methods: CMS, two-level CMS and RASTA for robust telephone speech recognition. For all experiments, a Korean isolated 84-word-database consisting of 80 speakers collected from...
متن کاملMultivariate Cepstral Feature Compensation on Band-limited Data for Robust Speech Recognition
This paper describes a new method for compensating bandwidth mismatch for automatic speech recognition using multivariate linear combinations of feature vector components. It is shown that multivariate compensation is superior to methods based on linear compensations of individual features. Performance is evaluated on a real microphone-telephone mismatch condition (this involves noise compensat...
متن کامل